|
|
Accession Number |
TCMCG075C10923 |
gbkey |
CDS |
Protein Id |
XP_017973432.1 |
Location |
complement(join(32524850..32524929,32527624..32527777,32528139..32528261,32528338..32528452,32528587..32528685,32528762..32528814,32529428..32529584,32530044..32530102,32530270..32530337,32530423..32530566,32531239..32531302,32532079..32532129,32532232..32532371,32532480..32532619,32533059..32533144,32533231..32533376,32533502..32533574,32534300..32534375,32534615..32534740,32534874..32534920,32535143..32535188,32535998..32536110,32536207..32536236)) |
Gene |
LOC18606235 |
GeneID |
18606235 |
Organism |
Theobroma cacao |
|
|
Length |
729aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_018117943.1
|
Definition |
PREDICTED: DNA mismatch repair protein MSH4 isoform X8 [Theobroma cacao] |
CDS: ATGGCTCGTGGTTGCTTTGATGACACCAAGGGTGCAATGCTGATTAAAAATTTAGCTGTCAGAGAGCCTTCAGCCCTTGGTTTGGATAGTTACTACAAACAGTATTATCTTTGCTTGGCTTCTGCTTCTGCTACAATCAAATGGATAGAAGCAGAGAAAGGTGTTATTGTCACAAATCATTCCTTATCGGTTACTTTTAATGGATCATTTGACCACATGAACATTGATGCTACTAGTTTGATTGTGGATGGCAGTGTCCAAAACTTAGAAATTATTGAACCTTTTCATTCTGCACTTTGGGGCACAAACAACAAGAAAAGAAGTCTATTCCACATGCTTAAGACAACAAAAACTGTTGGAGGGACTAGACTTCTTCGTGCCAATCTTTTGCAGCCTTTAAAAGATATCGAAACTATCAATACGCGTCTGGATTGCCTGGATGAGTTGATGAGCAATGAACAGCTATTCTTTGGACTGTCTCAGGTCTTGCGAAAGTTCCCAAAGGAGACTGATAGGGTACTTTGTCATTTCTGCTTCAAGCCAAAGAAAGTAACAAATGAAGTCTTGGTTGTGGAAAACACTAGAAAGAGCCAAATGCTGATATCAAGCATCATTCTTCTCAAAACTGCATTAGATGCCTTGCCGTTACTATCAAAGGTGCTTAAGGATGCAAAAAGTTTTCTTCTTGCAAATGTTTACAAGTCTATATGTGAAAACGAGAAATATGCTGACATTAGAAAGAGAATTGGAGTGGTGATTGATGAAGATGTGCTTCACGCACGGGTTCCTTTTGTTGCCCGCACACAGCAGTGTTTTGCTGTCAAGGCTGGCATTGATGGGCTATTGGATATAGCTCGGAGATCTTTTTGTGATACCAGCGAAGCTATACATAACCTTGCAAACAAGTACCGGGAAGAATTCAAGATGCCGAATCTGAAACTCCCATTTAACAGTAGACAAGGTTTTTACTTTAGCATTCCACAGAAAGACATTCAGGGACAGCTTCCCAGCAAGTTCATTCAGGTTGTGAAACATGGGAATAATGTACATTGTTCAACTTTGGAACTTGCTTCTCTGAATGTCAGAAATAAATCTGCGGCTGGAGAGTGTTATATACGAACAGAAGTTTGCTTGGAAGCCCTAGTTGATACCATAAGGGAGGATATCTCTGTGCTCACACTGCTTGCTGAAGTCCTGTGCCTGTTAGATATGATTGTTAATTCATTTTCTCATACAATATCAACCAAGCCTGTTGACCGATATATTAGGCCAGAATTTACTGATGATGGCCCTCTGGCAATTGATGCTGGTAGACACCCCATCCTAGAAAGCATACACTGTGATTTTGTGCCCAACAACATCTTTATTTCAGAAGCATCAAACATGGTTATTGCAATGGGGCCAAACATGAGCGGGAAGAGCACTTATCTTCAACAAGTGTGTCTCATAGTTATTCTTGCTCAGATTGGTTGCTATGTTCCTGCCCGCTTTGCAACAATTAGAGTAGTTGATCGTATATTTACAAGGATGGGCACAATGGATAATCTTGAATCAAACTCTAGTACGTTTATGACAGAGATGAAAGAGACTGCTTTTGTCATGCAGAATGTCTCCCAAAGGAGTCTGATTGTTATGGATGAACTTGGGAGGGCTACTTCGTCCTCTGATGGATTGGCAATAGCATGGAGCTGCTGTGAACATCTGCTATCACTCACTGCGTATACCATATTTGCTACTCATATGGAGAACTTGTCAGAATTAGCTACCATCTATCCAAATGTGAAAATTCTTCGCTTCGATGTTGATATTAGAAACAGCCGCCTAGATTTTAAGTTTCAACTCAAGGATGGACCAAGGCATGTAGCACACTATGGCCTTCTACTAGCAGAAGTGGCAGGATTACCGAGTTCGGTGATTGAAACAGCCAGAAGCATAACATCAAGGATTACAGACAAGGAAGTGAAGCGAATGGATGTAAACTGCCTGCACTATAATCAAATACAGTTGGCATATCATGTTTCTCAACGACTGATATGCTTGAAGTACTCCAACCATGACGAGGACTCCATCCGGCAGGCATTGCAAAGTCTCAAAGAGAGCTACATTGATGTGTGGGGGAATTTTGGAATCAAACTTGATCAGTCATCAGAGGGATGCGGTAAAACTTCGGCCCAAAGAATTATCGAATGA |
Protein: MARGCFDDTKGAMLIKNLAVREPSALGLDSYYKQYYLCLASASATIKWIEAEKGVIVTNHSLSVTFNGSFDHMNIDATSLIVDGSVQNLEIIEPFHSALWGTNNKKRSLFHMLKTTKTVGGTRLLRANLLQPLKDIETINTRLDCLDELMSNEQLFFGLSQVLRKFPKETDRVLCHFCFKPKKVTNEVLVVENTRKSQMLISSIILLKTALDALPLLSKVLKDAKSFLLANVYKSICENEKYADIRKRIGVVIDEDVLHARVPFVARTQQCFAVKAGIDGLLDIARRSFCDTSEAIHNLANKYREEFKMPNLKLPFNSRQGFYFSIPQKDIQGQLPSKFIQVVKHGNNVHCSTLELASLNVRNKSAAGECYIRTEVCLEALVDTIREDISVLTLLAEVLCLLDMIVNSFSHTISTKPVDRYIRPEFTDDGPLAIDAGRHPILESIHCDFVPNNIFISEASNMVIAMGPNMSGKSTYLQQVCLIVILAQIGCYVPARFATIRVVDRIFTRMGTMDNLESNSSTFMTEMKETAFVMQNVSQRSLIVMDELGRATSSSDGLAIAWSCCEHLLSLTAYTIFATHMENLSELATIYPNVKILRFDVDIRNSRLDFKFQLKDGPRHVAHYGLLLAEVAGLPSSVIETARSITSRITDKEVKRMDVNCLHYNQIQLAYHVSQRLICLKYSNHDEDSIRQALQSLKESYIDVWGNFGIKLDQSSEGCGKTSAQRIIE |